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1 Introduction 

The central step in the origin of life the 
emergence of organized structures from organic 
molecules available on the early earth. ! hese 
predecessors to modem cells, called “proto- 
cells, n were simple, membrane-bounded tinc- 
tures able to maintain themselves, glow, di- 
vide and evolve. Since there is no fossil ? ' cord 
of these earliest of life forms, it is a scientific 
challenge to discover plausible mechanisms for 
how these entities formed and functioned. To 
meet this challenge, it is essential to - reate 
laboratory models of protocells that capture 
the main attributes associated with livinsr sys- 
tems, while remaining consistent with known, 
or inferred, protobiological conditions, i hese 
model protocells could be considered as funda- 
mental “proofs of concept” for our under*? end- 
ing of the origin of life. 

Unfortunately, a comprehensive s*> uino 
describing the transition from inanimate mat- 
ter into the simplest protocell is still lacking. 
However, several progressively more sophisti- 
cated projects have been initiated to probe pro- 
tocellular life. Their ultimate go al is ic cre- 
ate a “minimal protocell” [l], which can be 
defined as a structure bound by memt.' uie- 
like material, capable of capturing enerk 1 and 
chemical substrates from the environmei • and 
using them to support the chemical re*- ons 
(metabolism) needed for self- main tenant -r and 
growth. A community of such model pror ells 
should be capable of evolving towards f ^her 
complexity. 

Currently, most projects aimed at ier- 
standing the emergence of protocells fot s on 
self-replication. Nucleic acids [2, 3], proteins 
[4, 5], membranes [6] and non- bio logic/' .! or- 
ganic molecules [7] have been used as elf- 
replicating units. In some instances, the f >lu- 
tionary potential of the self-replicating $;► tern 
was also examined. Even though none of ; ese 
projects will yield a true protocell, they v.l in- 
volve highly complicated processes. To ,v ' >m- 


plish these processes, it is usually necessary to 
use molecular components that, though sim- 
ple by biological standards, are too complex to 
have existed in a protobiological milieu. While 
this type of approximation is undesirable, the 
results of these studies are still of great value 
because they shed light on principles underly- 
ing the fundamental processes involved in the 
emergence of life. 

In this brief overview, we report on 
progress in another project aimed at under- 
standing how protocells worked and evolved. 
This project is a collaborative effort between 
scientists from NAS A- Ames Research Cen- 
ter and the laboratories of J. Szostak (Har- 
vard Medical School), J. Lanyi (University of 
California, Irvine) and D. Deamer (Univer- 
sity of California, Santa Cruz). It is dis- 
tinguished from other projects by its focus 
on protocellular metabolism and the coupling 
of metabolism to energy transduction, rather 
than on self- replication. This research is 

founded on the assumption that the emergence 
of systems endowed with genomes and capa- 
ble of Darwinian evolution was preceded by a 
pre-genomic phase, in which protocells func- 
tioned and evolved using mostly proteins, with- 
out self-replicating nucleic acids RNA). 

This view is motivated by findings that, while 
the building blocks of proteins and simple cell 
walls are relatively abundant in space and can 
be readily generated in experiments simulating 
conditions on the pre-biotic earth, the build- 
ing blocks of nucleic acids are quite rare un- 
der these conditions. Furthermore, RNA can- 
not readily perform such essential cellular func- 
tions a s energy capture and transport. Finally, 
since there is no relationship between the func- 
tion of a catalytic RNA and the function, if 
any, of the protein for which it can code, there 
is no clear path from the “RNA World” to to- 
day’s world of protein catalysis and nucleic acid 
information storage. 
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Figure 1: Schematic of the coupled BR-ATPase energy transduction system. 


2 Protocellular Membranes 

It is hypothesized that first protoceils were en- 
capsulated in vesicles — closed, spheroidal as- 
semblies of amphiphilic molecules enclosing an 
aqueous medium. Contemporary cell mem- 
branes and most model vesicles (liposomes) 
are typically composed of phospholipids. How- 
ever, in the absence of primitive biosynthetic 
pathways yielding phospholipids, it is unlikely 
that these molecules were present on the early 
earth. Instead, the most likely candidates 
for protocell-forming material were monocar- 
boxylic acids and alcohols with chain lengths 
sufficient to stabilize self- assembled structures. 
Recent studies have shown that under nar- 
rowly defined conditions of pH, temperature 
and concentration, a monocarboxylic acid as 
short as nine carbons in length was able to 
assemble into vesicular membranes. However, 
the addition of small amounts of an alcohol, 
nonanol, markedly stabilized the bilayers, so 
that vesicles were present at any pH above 
7 and at lower concentrations of carboxylic 
acid (D. Deamer, unpublished results). Signif- 
icantly, monocarboxylic acids and alcohols are 
also the main components of the amphiphilic 
material extracted from the Murchison mete- 
orite and shown to form vesicles [8]. We con- 
clude that membranous vesicles produced by 
mixed short-chain monocarboxylic acids and 
alcohols are plausible models for early mem- 
branes. 


3 Energy Transduction 

One of the essential functions of protocells was 
to capture energy from the environment and 
utilize it to synthesize the high-energy com- 
pounds that served as the source of energy to 
drive other protocellular reactions. The ini- 
tial source of environmental energy may have 
been light or, more likely, chemical energy (for 
a recent review see [9]). All known mod- 
em organisms transduce environmental energy 
by creating a transmembrane proton gradi- 
ent which is coupled to the production of 
adenosine triphosphate (ATP) from adenosine 
diphosphate (ADP) and inorganic phosphate. 
The synthesis of ATP is performed by mem- 
brane proteins called ATP synthases (ATP- 
ases). Although phylogenetic studies indicate 
that ATPases must have emerged before the 
last common ancestor [10], these molecules are 
too complex to be a part of an early protocell, 
and the nature of the earliest energy transduc- 
tion system is unknown. 

To begin the study of the coupling of en- 
ergy transduction to metabolism in protocells, 
a simple, model transduction system was con- 
structed. It consists of two proteins, bacteri- 
orhodopsin (BR) and the FoFi ATPase from 
the thermophile Bacillus PSS , embedded in li- 
posomes. BR is a light-driven proton pump, 
which generates the transmembrane proton 
gradient needed for the ATP-making activity 
of the ATPase. Although this system is most 
likely not protobiological, it is known to have 
high turnover and stability [11]. 
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To construct the system, purple mem- to external sources of energy. The next objec- 
branes containing wild type BR were solubi- tive is to simplify the components of the system 
lized and incubated with preformed liposomes and direct them to match better our ultimate 
in the presence of detergents. When illumi- goal — the creation of a protein-based model 
nated, the proteoliposomes demonstrated very of primitive metabolism. 


active proton transport directed inward, im- 
plying that the preferential orientation of the 
pumping species was inside-out. Alkalinization 
of unbuffered medium as large as 1 pH unit was 
observed. Next, BR was co-reconstituted with 
a thermostable ATPase from Bacillus PS3 into 
liposomes using a similar procedure as for the 
BR-only liposomes. The ATPase incorporated 
into the membrane with its Fi unit pointing 
outward, making it possible to utilize the pH 
gradient created by BR upon illumination (see 
Fig. 1). The system was stable under illumina- 
tion for several hours (J. Lanyi and L. Brown, 
unpublished results). 

It was further demonstrated that the pro- 
duction of ATP in liposomes could be coupled 
to a model, metabolic chemical reaction. As 
the model reaction, the production of acetyl- 
coenzyme A (acetyl CoA) from acetate and 
coenzyme A was chosen. This reaction is 
catalyzed by a thermophilic acetyl-CoA syn- 
thetase from Pyrococcus furiosus , which uti- 
lizes ATP, but not ADP to make acetyl-CoA. 
After adding this enzyme and appropriate sub- 
strates to liposomes, net synthesis of acetyl- 
CoA upon illumination was observed (J. Lanyi 
and L. Brown, unpublished results). 

The next step will be to use the same en- 
zyme, which is known to have broad specificity, 
to form different long-chain acyl- Co As using 
fatty acids as substrates. Then, acyl-CoA 
transferase will add the fatty acid chain to lyso- 
phosphatylcholine (a lipid with only one hy- 
drocarbon tail), and produce a new lipid. In a 
similar experiment performed previously, oleyl- 
CoA was coupled to lyso-phosphatylcholine, 
and production and growth of liposomes was 
observed (D. Deamer, unpublished results). 
Since higher concentrations of the lyso-lipid 
act as a detergent, a dynamic balance be- 
tween liposome formation and destruction will 
be achieved if a mixture of different hydrocar- 
bon chains is included. This, in turn, will in- 
troduce “natural selection” for the best com- 
ponents that can self-assemble into liposomes. 

The results, described so far, demonstrate 
how simple metabolic reactions can be coupled 


4 A Simplified Proton Pump 

The task of creating a simple proton pump 
built of molecules that could have existed in 
the protobiological milieu may be helped by 
an unlikely model: the M 2 protein from hu- 
man influenza virus. Four identical fragments 
of this protein, each only 25 amino acids long, 
fold into helices and aggregate to form a small, 
membrane-spanning channel. In response to an 
applied voltage, protons are conducted through 
a pore in the middle of the channel at remark- 
ably high rates [12]. It has been postulated 
that protons travel along the network of wa- 
ter molecules filling the pore of the channel. 
This mechanism, however, must involve an ad- 
ditional step because the channel contains four 
histidine amino acid residues, one from each 
of the helices, which are sufficiently large to 
occlude the pore and interrupt the water net- 
work. Thus, the histidine residues form a gate, 
which controls transport of ions. Two mech- 
anisms of gating have been proposed. In one 
mechanism, all four histidines acquire an ad- 
ditional proton and, due to repulsion between 
their positive charges, move away from one an- 
other, thus opening the channel [13]. In the 
alternative mechanism, one proton is captured 
on one side of a histidine residue while an- 
other proton is released from the opposite side. 
Then, the channel returns to the initial state 
through tautomerization [15]. 

Large-scale, atomic-level molecular dynam- 
ics simulations of the channel with the his- 
tidine residues in different protonation states 
[16] revealed that all intermediate states of 
the the tautomerization mechanism are struc- 
turally stable (see Fig. 2, right), and the ar- 
rangement of water molecules in the channel is 
conducive to the proton transport. In contrast, 
in the four-protonated state, postulated to ex- 
ist in the gate-opening mechanism, the electro- 
static repulsion between the histidine residues 
appears to be so large that the channel loses its 
structural integrity (see Fig. 2, left). This re- 
sult indicates that such a mechanism of proton 
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Figure 2: Richardson diagram of intermediate states of two mechanisms for proton 
translocation through the M 2 protein of human Influenza A. Histidines and water wire 
shown as dark and light CPK models, respectively. Solvating bilayer and waters omitted. 
Figure made with MoLSCRIPT [14] 


transport is unlikely. 

The results also suggest how the channel 
can be re-engineered to become a simple pro- 
ton pump. First, a proton source must be 
placed inside the channel. Among many pos- 
sible sources of protons, probably the simplest 
are polycyclic aromatic hydrocarbons [17] — 
a class of molecules readily available on the 
early earth. Once the channel is coupled with 
a proton source, the last thing that remains to 
be done is to ensure that protons transferred 
through the channel to the aqueous medium 
are unlikely to be used to regenerate the pro- 
ton source. Since the proposed proton pump 
is asymmetrical, this should be possible to ac- 
complish by appropriate genetic modification 
of the channel. 

5 In vitro Evolution of Novel 
Peptide Enzymes 

In considering the other functions performed 
by protocells, no suitable models from mod- 
em organisms are readily available. Previous 
attempts to design small, functional proteins 
from first principles have been less than suc- 
cessful. Instead, a novel method for the in 
vitro evolution of protein enzymes towards ar- 
bitrary catalytic targets will be employed. A 


similar approach has already been developed 
for nucleic acids: First, a very large popula- 
tion of candidate molecules is generated using 
a random synthetic approach. Next, the small 
number of molecules that can accomplish the 
desired task are selected. These molecules are 
then vastly multiplied using the polymerase 
chain reaction. A mutagenic approach, in 
which the sequences of selected molecules are 
randomly altered, can yield further improve- 
ments in performance or alterations of specifici- 
ties. Unfortunately, unlike nucleic acids, pro- 
teins cannot be directly amplified. In the new 
technique, this problem is circumvented by co- 
valently linking each protein of the initial, di- 
verse, pool to the RNA sequence that codes 
for it. Then, selection is performed on the 
proteins. The selected protein- nucleic acid fu- 
sion molecules are then cleaved and the nucleic 
adds amplified and, if desired, mutated. These 
nucleic adds are then translated into proteins, 
fused to these proteins and another round of 
selection is performed [18]. 

Using this technique, we will generate mod- 
els for several important proto-enzymes in- 
volved in the creation of new proteins. The 
formation of a peptide bond between amino 
acids requires the input of energy. Thus the 
first target of the new in vitro evolution proce- 
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Figure 3: Average catalytic efficiency of peptides in a test protocell simulated by the 
Inherited Efficiencies Model. Left panel shows the effect of decreasing ease of forming 
ligases (lower line) relative to a reference model (upper line). Right panel shows effect 
of decreasing preference for cutting bonds in unstructured peptides (lower line) relative 
to the same reference model (upper line). 


dure will be an enzyme that can use the ATP 
generated by the transduction system to phos- 
phorylate amino acids, thereby preparing them 
to be used to form new proteins. The second 
target for in vitro evolution will be am enzyme 
that can catalyze the formation of a peptide 
bond in the absence of a nucleic acid template, 
a peptide ligase. In modem biology, there are 
very few examples of such enzymes [19], and 
all are quite large and complex. 

6 Evolution without a genome 

Protocells must have been able not only to 
maintain themselves but also to evolve. Dar- 
winian evolution occurs through the accumu- 
lation of a series of small alterations to func- 
tioned molecules whose identities are stored. 
Protocells, however, may have been incapable 
of such storage. We hypothesize that under 
such conditions, the replication of functions 
and their inter-relationships, rather them the 
precise identities of the functional molecules, 
is sufficient for survival and evolution. This 
process is called non-genomic evolution. 

Central to this concept is the emergence 
of weak ligases, that would produce a di- 
verse pool of peptides. A few of the pep- 
tides so generated could have been better cata- 
lysts of peptide bond formation than the proto- 
enzymes that formed them, thereby generat- 
ing even more peptides and increasing the 
chances of producing functional ones. Some of 
these functional peptides would be proteases, 


proto-enzymes that cut peptide bonds. Since 
proteases cleave unstructured peptides more 
rapidly than structured ones, and since func- 
tional peptides have some degree of ordered 
structure, proteases would preferentially de- 
stroy non-functional peptides. Occasionally, 
the newly produced peptides would be capa- 
ble of performing novel functions. If these 
novel peptides integrated into the protocellular 
metabolism, they could increase the capabili- 
ties of the protocell. Eventually, this process 
could lead to the emergence (or utilization) of 
nucleic acids and their coupling with peptides 
into a genomic system. 

To examine the evolutionary potential of 
a non-genomic system, we have developed a 
simple, computationally tractable model that 
is still capable of capturing the essential bio- 
chemical features of the real system. In this 
model, the specific identities of amino acids 
are not considered, and the catalytic efficien- 
cies of all peptides of the same length are nor- 
mally distributed. As peptides grow longer, 
their mean efficiency also grows, but non- 
monotonically: only when peptides reach suf- 
ficient lengths to adopt ordered structures do 
the average efficiencies increase markedly. Pep- 
tides act as both catalysts and reaction sub- 
strates. In the simplest implementation of the 
model, only two catalyzed reactions were con- 
sidered: the formation (polymerization) and 
destruction (hydrolysis) of peptide bonds. The 
properties of the products of these reactions are 
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related to the property >f the reactants, and 
so the model is called t: inherited Efficiencies 

Model. 

Simulations of tliis del were carried out 
using a Monte Carlo v: !od [2d]. For many 

choices of model para. ers, the overall cat- 
alytic efficiency of the tocell w as observed 
to increase. Two pro; ies strongly affected 
the ability of the prot j to evolve: the bal- 
ance between the prcba 1 ;ties that a peptide is 
an efficient protease and i efficient ljgase, and 
the strength of the pre' .^nce toward the hy- 
drolysis of unstructured ptides. It was found 
that when efficient ligav were not difficult to 
form (Fig. 3, left pane; r when there was a 
strong preference for the . /drolysis of unstruc- 
tured peptides (Fig. 3 t ht panel), any long, 
highly efficient peptide* rmed would not be 
destroyed before they c i accelerate the rate 
at which new, and po6 y efficient, peptides 
were formed. Under th* ronditions, the abil- 
ity of the protocell to ir jve its catalytic ca- 
pabilities was high. Th esults demonstrate 

the possibility of a nc^ nechanism of early 
protocellular evolution does not require 

the presence of a genon ;:or any form of se- 
quence complementary the exact replica- 

tion of protein sequent- In fact, the sloppy 
replication of protein sc .ces is an advantage 

in the earliest phase of ation because it al- 
lows for the rapid exp!< on of the space of 

proteins and the discovt; f new functions. It 

is the preservation of t! - functions and their 
inter-relationships whj'. . ust be maintained 
during this early stage evolution, not the 
identity of the actors ; rming those func- 
tions. 
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